Sanskrit Compound Processor
نویسندگان
چکیده
Sanskrit is very rich in compound formation. Typically a compound does not code the relation between its components explicitly. To understand the meaning of a compound, it is necessary to identify its components, discover the relations between them and finally generate a paraphrase of the compound. In this paper, we discuss the automatic segmentation and type identification of a compound using simple statistics that results from the manually annotated data.
منابع مشابه
Formal Structure of Sanskrit Text: Requirements Analysis for a Mechanical Sanskrit Processor
We discuss the mathematical structure of various levels of representation of Sanskrit text in order to guide the design of computer aids aiming at useful processing of the digitalised Sanskrit corpus. Two main levels are identified, respectively called the linear and functional level. The design space of these two levels is sketched, and the computational implications of the main design choices...
متن کاملSemantic Processing of Compounds in Indian Languages
Compounds occur very frequently in Indian Languages. There are no strict orthographic conventions for compounds in modern Indian Languages. In this paper, Sanskrit compounding system is examined thoroughly and the insight gained from the Sanskrit grammar is applied for the analysis of compounds in Hindi and Marathi. It is interesting to note that compounding in Hindi deviates from that in Sansk...
متن کاملVaakkriti: Sanskrit Tokenizer
Machine Translation has evolved tremendously in the recent time and stood as center of research interest for many computer scientists. Developing a Machine Translation system for ancient languages is much more fascinating and challenging task. A detailed study of Sanskrit language reveals that its well-structured and finely organized grammar has affinity for automated translation systems. This ...
متن کاملA New Computational Schema for Euphonic Conjunctions in Sanskrit Processing
Automated language processing is central to the drive to enable facilitated referencing of increasingly available Sanskrit E-texts. The first step towards processing Sanskrit text involves the handling of Sanskrit compound words that are an integral part of Sanskrit texts. This firstly necessitates the processing of euphonic conjunctions or sandhi-s, which are points in words or between words, ...
متن کاملON KNOWLEDGE REPRESENTAnON USING SEMANTIC NETWORKS AND SANSKRIT
The similarity between the semantic network method of knowledge representation in artificial intelligence and shastric Sanskrit was recently pointed out by Briggs. As a step towards further research in this field, we give here an overview of semantic networks and natural-language understanding based on semantic networks. It is shown that linguistic case frames are necessary for semantic network...
متن کامل